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ABSTRACT 


In  this  paper,  it  is  shown  that  the  upper  and  lower  bounds  of  the  errors 

<1 
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in  the  Newton  iterates  recently  obtained  by  Potra-Ptak  [11]  and  Miel  [7],  with 
the  use  of  nondiscrete  induction  and  majorizing  sequence,  respectively,  follow 
immediately  from  the  Kantorovich  theorem  and  the  Kantorovich  recurrence 
relations.  It  is  also  shown  that  the  upper  and  lower  bounds  of  Miel  are 
sharper  than  those  of  Potra-PtSk.  L 
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SIGNIFICANCE  AND  EXPLANATION 


To  find  precise  error  bounds  for  iterative  solutions  of  equations  is  one 
of  the  important  subjects  in  numerical  analysis.  This  paper  shows  that  the 
upper  and  lower  bounds  of  the  errors  in  the  Newton  iterates  recently  obtained 
by  Potra-Pt&k  [11]  and  Miel  [7]  follow  from  the  Kantorovich  theorem,  and  that 
the  bounds  of  Miel  are  sharper  than  those  of  Botra-PtSk. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 


ERROR  BOUNDS  FOR  NEWTON'S  ITERATES  DERIVED  FROM  THE  KANTOROVICH  THEOREM 


Tetsuro  Yamamoto 


1.  Introduction 


n 


}  The  Kantorovich  theorem  for  the  Newton  method  is  of  fundamental  importance 


/ 


in  the  study  of  nonlinear  equations  in  Euclidean  and  Banach  spaces.  Let  X  and 
Y  be  Banach  spaces ,  D  be  an  open  convex  subset  of  X  and  F  :  D  C  X  -*■  Y  be 
a  FrSchet  differentiable  operator  which  satisfies  a  Lipschitz  condition  in  D. 
Then,  the  theorem  guarantees  the  existence  and  uniqueness  of  a  solution  of  the 
equation  F(x)  =  0  and  the  convergence  of  the  Newton  process  to  the  solution. 

By  replacing  the  original  assumption  that  F  belongs  to  C  -class  in  D  by  a 
weaker  one  of  the  Lipschitz  continuity  of  F'  in  D,  an  affine  invariant 
version  of  the  theorem  is  stated  as  follows: 

Theorem  1  (Kantorovich-Akilov  [4;  Theorem  6  (1. XVIII)]).  Let  F  :  D  C  X  Y  be 
Frfichet  differentiable.  Assume  that,  at  some  xQ  e  D,  F'(x0)  is  invertible 
and  that 

»F'(x0)-1(F'(x)  -  F'(y))l  <  Klx  -  yl,  x,y  e  D  (1) 

IF'(x0)_1F(x0)»  <  n,  h  -  Kn  <  1/2  ,  (2) 

S<xo,t*)  CD,  t*  -  2n/( 1  +  /I  -  2h )  .  (3) 

Then: 


i)  The  Newton  iterates  *n+i  ■  Xj,  -  F' (xn)”*F(xn) ,  n  i  0,  are  well-defined, 
lie  in  3(x0,t*)  and  converge  to  a  solution  x*  of  F{x)  *  0. 

ii)  The  solution  x*  is  unique  in  S(xg,t**)  O  D,  t##  *  (1  +  /I  -  2h)/K  if 
2h  <  1,  and  in  S(X(),t**)  if  2h  -  1 . 
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iii)  Error  estimates 


lx*  -  x  I  < 

n 


t* 

„1-n 


2n-1 
(2hT  n 


(n  -  0) 

(nit). 


(4) 


are  valid* 

Remark  1 ♦  Hie  importance  of  such  an  affine  invariant  formulation  is  stressed  in 
Deuf lhar d-Heindl  [2]. 

Remark  2.  The  condition  (3)  may  be  replaced  by  the  weaker  conditions 

x0  6  D,  5(x1ft*  -  n)  CD  ,  (5) 

which  are  due  to  Schmidt  [14].  In  fact,  by  induction  on  n  and  the  well-known 
majorant  principle  for  the  Newton  iterates,  we  can  prove  that,  under  the 
assumptions  (1),  (2)  and  (5),  the  Newton  iterates  are  well-defined  and 
xn  e  S(x1#t*  -  n),  nil. 

There  are  many  literatures  ( [1] » [3] , [5]-[7] , [9]-[1 1] , [ 16] )  on  the 
improvements  of  the  estimates  (4).  For  example,  under  the  assumptions  of 
Theorem  1  (or,  by  replacing  (3)  by  (5)),  the  following  results  hold. 

Theorem  2  (Gragg-Tapia  [3]). 


lx*  -  x  I  < 
n  * 


a2n 

~  /I  -  2h  ■- 
h  „n  1  0 

1  - 


2l_n|X1  -  Xg  I 


lx,  -  xftl  if  2h  <  1  , 


if  2h  -  1  , 


(6) 


and 


3 lx  i  —  x  I  2n— ^ 

— — —  <  lx*  -  x  I  <  8  lx  -  x  « I ,  n  >  1  , 

- -  n  -  nn-1  - 

»n  (7) 


1  + 


40 


_n  _ 

2  2 

(1  +  0  ) 


where  0  -  t*/t**  -  (1  -  /I  -  2h)/(1  +  /I  -  2h). 


if  2h  <  1/  and 

i __  J  2^  2 

2  n(/l  +  —  lx  ,  -  x  I  •  1)  <  lx*  -  x  I  <  -  lx  -  x  I  (10) 

n  n+1  n  =  n  »  n  n  n-1  ' 

if  2h  -  1. 

1  2 

Remark  3.  Let  f(t)  *  -j  Kt  -  t  +  n  and  define  the  sequence  {tn)  by 

*"0  *  °*  tn+‘\  *  *n  "  f{tn,/f,(tn>»  n  *  0  • 

Then  the  well-known  majorant  principle  due  to  Kantorovich  asserts  that 

lx*  -  xn«  £  t*  -  tn,  n  *  0  .  (11) 

The  more  general  arguments  are  developed  in  Ortega-Rheinboldt  [9]  and  Schmidt 
[14],  [15].  On  the  basis  of  Ostrowski's  results  [10;  Appendix  F] ,  however,  we 
can  show  that  the  bounds  (11)  are  the  same  as  ( 6 ) ,  provided  that 
r\  *  lx ^  -  xQl.  Furthermore,  we  note  that  the  upper  bounds  in  Theorems  2-4 
coincide  for  n  «  1,  and  are  equal  to  t*-t^«(1-h-/l-  2h )/K  with 
h  "  Ix^  -  XqI. 

Theorem  2  was  derived  with  the  use  of  the  Kantorovich  recurrence  relations 
Theorems  3  and  4,  improved  versions  of  Theorem  2,  were  obtained  recently  by 
nondiscrete  induction  and  the  majorizing  sequence,  respectively.  In  [7],  Miel 


3 


I  has  mentioned  that  it  turns  out  that  the  upper  bounds  in  (9)  are  sharper  than 

i 

those  in  (8),  and  that  numerical  experiments  also  indicate  that  the  lower  bounds 

i 

|  in  (9)  are  finer  than  those  in  (8).  In  this  paper,  with  the  use  of  the  same 

technique  as  in  the  previous  paper  [17],  we  shall  show  that  Theorems  3  and  4 
follow  immediately  from  Theorem  1  and  that  Theorem  4  improves  Theorem  3. 

I 

2.  Basic  leans 

To  derive  Theorems  3  and  4  from  Theorem  1,  we  need  the  following  basic 

|  lemmas  and  give  their  proofs  for  the  sake  of  completeness. 

lean  1.  (Aider  the  assumptions  of  Theorem  1,  define  three  sequences  {B^} , 

{n  },  and  {h  }  by 
n  n 


Vi 

B0  *  1f  Bn  *  1  -  hn_1  ' 


.  _  ViVi 

n0  ,X1  "  V'  nn  “  2(1  -  h  7T  ' 

n- 1 


hn  *  h  *  ten,  h  *  KB  n 


n-1 


"  "  "  2(1  -h  ,)2  ' 

n-1 


n  ■  1,2,...  , 


respectively.  Then  we  have 


■  F*(x J_V(x-)l  <  B  and  IF'(x  )_1F(x  )l  <  r\  . 
n  0  *  n  n  n  *  n 

_  i 

In  particular,  if  2h  »  1,  then  2h_  =  1  and  n  *2  n  .  “  2n . 

11  n  n— i 

Proof .  This  is  a  direct  application  of  the  original  recurrence  relations  to 
F,(X0)"1F  (Cf.  Rail  [13]).  Q.E.D. 

lease  2.  The  speed  of  convergence  of  the  iterates  is  estimated  by 


KB 


lx*  -  x  I  < 
n  ■ 


n_1  .  *  .2 
lx*  -  x  .1 
n-1 


(12) 


-4- 


vs 


Similarly  we  have 


KB  2 

lx  -  X  I  <  ■  ,  ■  lx  -  X  ,1 
n+ 1  n  =  2  n  n-1 


(13) 


Proof «  Die  estimates  (12)  follow  from  (1),  Lemma  1  and  the  relations 

x*  -  x_  -  -F ' ( x  )_1{F(x*)  “  F(x  ,)  -  F' (x  *)(x*  -  x  )} 

“  n- 1  n- 1  n-i  n- 1 


“F,(xn-1)"lF,(x0>  /  F'<x0> 

0 


_  i 


•  {F'(x  1  +  t(x*  -  x  .))  -  F'(x  1)}(x*  -  x  )dt  . 

n- 1  n—  *  n- »  n~  i 

Similarly,  the  estimates  (13)  follow  from  the  relations 

’‘n+l  -  ’‘n  *  F'(xn)"1F(,cn) 

«  F^r^FU^)  +  F^x*.,)^  -  x^,) 


+  l  <F'(xn-1  +  t(xn  “  xn-1 ^  ^  "  F,(xn-1)}(xn  "  *n-1)dtl 


F'(xn)-1F'(x0)[  /  F'(xq) 


-1 


•  {F'(X  +  t ( x  -  x  ))  -  F'(x  ,)}(x  -  x  1)dt]  .  Q.E.D. 

n-i  n  n- 1  n~  i  n  n~i 

Lemma  3  (Basic  Error  Estimates).  Under  the  assumptions  of  Theorem  1,  we  have 


2lx  .  -  x  I 
n+1  n 


1  +  /l  +  2KB  lx  , ,  -  x  I 
n  n+1  n 


<  lx*  -  x  I  < 
■  n  = 


2lx  .  -  x  I 
n+1  n 


1  +  /I  -  2KB  lx  , x  I 
n  n+ 1  n 


(14) 


Proof.  Replace  x0  and  n  in  Theorem  1  by  xn  and  ,xn+i  “  xrI , 

respectively.  Then  (1)  is  replaced  by 

IF' (x  )-1(F'(x)  -  F’(y))l  <  IF' (x  ) -1F’ (xft ) I  •  IF’ (xn) _1 (F' (x)  -  F'(y))l 
n  *  n  u  u 

i  KBn,x  -  y»,  x,y  €  D  . 

Therefore,  the  upper  bounds  in  (14)  follow  from  Theorem  1.  Furthermore,  we  have 


-5- 


from  Leona  2 


KB 


or 


lx  -  x  I  <  lx  . ,  -  x*l  +  lx*  -  x  I  <  lx*  -  x  I2  +  lx*  -  x  I  , 
n+ 1  n  =  n+ 1  n  *  2  n  n 


KB 


lx*  -  x  |2  +  lx*  -  x  I  -  lx  -  x  I  >  0  . 
2  n  n  n+1  n  ■ 

Solving  this  yields  the  lower  bounds  in  (14).  Q.E.D. 

4.  We  have 


B/l-2h=/1-2h,  niO, 
n  n  ~ 


(15) 


and 


B 


A  -  2h  + 


„  (®1  )  =1,  nil.  (16) 

n  n- 1 

Proof.  The  equalities  (15)  and  (16)  are  trivial  for  n  *  0  and  n  =  1, 
respectively.  If  n  ^  1,  then  we  obtain 


1  -  2hn  =  1  - 


n-1 


1  -  2h 


n-1 


1  -  2h, 


(1  hn-1 '  (1  -Vl>  (1  -hn-1>  —  (1  "  V' 


-  (1  "  2h)B^  , 

which  proves  (15).  Furthermore,  if  n  ^  2,  then  we  have 


n- 1 

b”2  m  n  (1  -  hi)2 

i=0 

n-2  2  2  n”2 

-  (i  -  ah..,)  n  (1-h  )  +h  n  (i-h±) 

1=0  i*0 

n-2 

=  1  -  2h  +  {KB  n  n  (1-h.)} 
n_1  n“1  i*o  1 

=  1  -  2h  +  (Kh  ,)2 
11“  1 


n-2 

since  Bn_,  IT  (1  -  hA)  *  1.  This  proves  (16).  Q.E.D. 
i-0 


■6' 


3.  Proof  of  Theorem  3 


Let  us  now  prove  Theorem  3.  We  put  en  =  ixn+1  -  xn«.  Then,  by  Lemma  4,  we 


have 


Bn1  “  /l  -2h  +  (Kn  ,)2  >  /l  -  2h  +  (Ke  T)2  . 
“  n- i  =  n- l 


Hence  we  obtain  from  Lemmas  2-4 


2e 


2e 


II  x*  -  x  I  < 


11  =  1  +  /I  -  2KB  e  =  1  +  /l  -  2KB  n 

n  n  n  n 


KB  e2 
n  n- 1 


Ke 


n-1 


-1 


1  +  Vi  -  2h  B  +  /I  -  2h 
n  n 


Ke 


n-1 


A  -  2h  + 


nr  2 

/u  +  e  „  -  a 


'n-1 


(Ke  ,)  +  /I  -  2h 

n—  I 


where  a  *  /l  -  2h/K.  Next,  to  derive  the  lower  boundr  in  (8),  we  observe  that 
by  Lemma  4 


/a2  +  n2  „  *  r  /t~-  2h  + 


n-1  K 


n 


so  that 


1  ™  2 

e„  <  -  KB  e  „ 
n  2  n  n-1 


2  2 

e„  i  e  , 

n-1  n-1 


a/.2  *  '  2/a2  +  «2  , 

n— i  n— i 


or 


en-1  *  2e  +  e2  . 

n-i  *  n  n-1 


which  are  equivalent  to 


{A2  +  ef  ,  -  e  )2  >  o2  +  e2  . 


»  4  V  / 

n-1  n 


(17) 


Moreover  we  have 


™  +  «n-l  i  ®n'  n  "  1»2,***  • 

In  fact,  the  inequalities  (18)  follow  from  the  inequalities 


(18) 


-7- 


I 


?*  «, .  *.  * «  .n  .v  a’ 


®n  *  2  6n-1  =  2(1  -  h  )  *n-1  -  2  ®n-1  *  *n' 

n-  i 


Hence  we  have  from  (17) 


/  2  ,  2  .  A  /  2^2 
ro  +e  4>e  +/o  +e 

n-1  *  n  n 


Consequently  we  obtain  from  Lemma  3 


x  I  > 
n  * 


1  +  /I  +  2KB  e 
n  n 


1  +  /l  +  2e  //a?  +  « 

n  n-1 


1  +  /l  +  2e  /Jo2  +  e2 

n  n-1 


1  +  /l  +  2e  /(e  +  A2  +  e2) 

n  n  n 


2e  (e  +  A2  +  e2) 
n  n  n 


e  +  A2  +  e2  +  /(e  +  A2  +  e2)(3e  + 

n  n  n  n  n 


=Y(en)  * 

This  completes  the  proof  of  Theorem  3. 


4.  Proof  of  Theorem  4 


To  prove  Theorem  4,  we  use  Gragg-Tapia's  result 


*4 


If  2h  <  1,  then  we  have  from  (19) 


_ i_ 

/I  -  2h  /I  -2h 


2 

1  -  9^  _  2_ 

-n  "  KA 

1  +  9 


1  +  9  ‘ 


(20) 


where  A  =  t**  -  t*.  If  2h  =  1#  then  obviously  we  have  Bn  =  2n  and 


2e 


n 


1  +  /l  +  2KB  e 
n  n 


2e 


n 


D 


Therefore  the  lower  bounds  in  Theorem  4  coincide  with  those  in  Lemma  3. 
Furthermore ,  we  obtain  from  (20)  and  (19) 


2 _ 

1  +  /I  -  2h 

n 


KB 

n 

1  +  /I  -  2h 

n 


if  2h  <  1.  Hence  the  upper  bounds  in  (9)  reduces  to 


lx*  -  x  I  < 

n  = 


KB  e  , 
n  n-1 


Ke 


n-1 


1  +  /I  -  2h 


n 


-1  , - 

B  +  /I  -  2h 
n 


(21) 


which  follow  immediately  from  Lemmas  2  and  3.  The  upper  bounds  in  (10)  also 
reduce  to  (21)  because  Bn  =  2n  and  K  =  1/2ri  if  2h  =  1.  The  proof  is 
completed. 


5.  Observations 

By  our  proofs  of  Theorems  3  and  4,  we  see  that  Potra-Pt&k's  upper  bounds 

are  obtained  by  replacing  b”1  in  the  last  expression  of  (21)  by  the  smaller 

J  2 

quantities  /I  -  2h  +  (Ken_1)  •  Similarly  the  lower  bounds  Y(en)  in  (8)  are 
obtained  if  we  replace  KBn  in  the  lower  bounds  in  (14)  by  the  larger 


J  2  2 

quantities  (e_  +  /a  +  e  )  .  Therefore  we  can  conclude  that  Theorem  4  is 

11  n 

finer  than  Theorem  3.  He  remark  also  that  Gragg-Tapia ' s  upper  bounds  in  (6)  and 
(7)  are  equal  to 


T 

n 


2n 

n 

1  +  /i“-  2h 

n 


and 


®n-1  ' 


(22) 


respectively  (cf .  Yamamoto  [17]).  As  was  shown  in  [11]#  the  upper  bounds 
of  Potra-Pt£k  are  sharper  than  (22).  Furthermore,  it  is  easy  to  see  that  the 
lower  bounds  of  Fotra-PtSk  improve  those  of  Gragg-Tapia  in  (7).  (This  fact 
remains  unproved  in  [11].)  To  prove  this,  we  note  that  the  lower  bounds  in  (7) 
may  be  written 


and  that 


2e 


_ n 

1  +  Vi  +  2h 

n 


n 


n 


/  2  2  .  /  2  2  n  +  (KB 

e+/a+e  n  +  /a  +  n  n  n+1 

n  n  n  n 


-1 


hn 


/  2  2 

where  we  have  used  the  fact  that  the  function  g(t)  *  t/(t  +  /a  +  t  ) 
monotonically  increasing  with  respect  to  t.  Hence  we  obtain  that 


is 


2e 


2e 


Y(e_ ) 


"  /  /2  ""  2  .i  "  1  ♦  S'  * 

1  +  /l  +  2e  (e  +  .fa2  +  e2)  1 


n  n 
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